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It is generally believed that, in the thermodynamic limit, the microcanonical description as a 
function of energy coincides with the canonical description as a function of temperatnre. However, 
various examples of systems for which the microcanonical and canonical ensembles are not equivalent 
have been identified. A complete theory of this intriguing phenomenon is still missing. Here we show 
that ensemble nonequivalence can manifest itself also in random graphs with topological constraints. 
We find that, while graphs with a given number of links are ensemble-equivalent, graphs with a given 
degree sequence are not. This result holds irrespective of whether the energy is nonadditive (as in 
unipartite graphs) or additive (as in bipartite graphs). In contrast with previous expectations, our 
results show that: (1) physically, nonequivalence can be induced by an extensive number of local 
constraints, and not necessarily by long-range interactions or nonadditivity; (2) mathematically, 
nonquivalence is determined by a different large-deviation behaviour of microcanonical and canonical 
probabilities for a single microstate, and not necessarily for almost all microstates. The latter 
criterion, which is entirely local, is not restricted to networks and holds in general. 
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Background. In statistical physics, calculating the equi¬ 
librium properties of a system with a given energy re¬ 
quires averaging over the so-called microcanonical ensem¬ 
ble di , i.e., the uniform distribution on the set of all 
particle configurations having a prescribed energy. Apart 
from trivial examples, this is a mathematically challeng¬ 
ing task. Moreover, it is difficult to physically realize 
a situation where there is no uncertainty in the energy 
of the system. Therefore, it is often preferable to work 
with the so-called canonical ensemble 0, i.e., a proba¬ 
bility distribution with maximal entropy on an extended 
set of configurations that ‘violate’ the desired energy, 
but in such a way that the average energy matches the 
prescribed value. This is achieved through an appropri¬ 
ate temperature, mathematically arising as the Lagrange 
multiplier enforcing the prescribed average energy. 

Starting with the work of Gibbs Q , the microcanonical 
and canonical ensembles have been shown to be equiva¬ 
lent in the thermodynamic limit (i.e., when the number 
of particles in the system tends to infinity) for physical 
systems with short-range interactions. The original argu¬ 
ment is that in the canonical ensemble at hxed tempera¬ 
ture the energy fluctuations are negligible with respect to 
the average energy, so that in the thermodynamic limit 
the canonical ensemble is effectively microcanonical with 
a sharp value of the energy. Today, most textbooks in 
statistical physics still convey the message that equiva¬ 
lence of ensembles holds universally for every physical 
system, justifying the use of energy and temperature as 
two different parameters giving an equivalent description. 

However, in the past decades various studies have high¬ 
lighted that ensemble equivalence breaks down in certain 
models of fluid turbulence @,01, quantum phase sep¬ 
aration ill, star formation [JQ, nuclear fragmenta¬ 


tion M , and networks [ij Physically, it is believed 
that nonequivalence is associated with l ong -range inter¬ 
actions or other forms of nonadditivity [14| . However, a 
complete theoretical understanding of the phenomenon is 
still missing. Mathematically, ensemble nonequivalence 
has been approached in various ways [Hill. In particu¬ 
lar, the microcanonical and canonical ensembles are said 
to be thermodynamically equivalent 0] when the entropy 
and the free energy of the system are one-to-one related 
via a Legendre transform. The ensembles are said to be 
macrostate equivalent 0 when the sets of equilibrium 
values of the macrostate (energy, magnetization, etc.) are 
the same. Finally, a recent and mathematically appealing 
definition is that of measure equivalence [l6| . according 
to which the ensembles are said to be equivalent when 
the canonical probability distribution converges to the 
microcanonical probability distribution in the thermody¬ 
namic limit. Under certain hypotheses, the three defini¬ 
tions have been shown to be equivalent [l6| . Moreover, 
large deviation theory 0 shows that the ensembles are 
nonequivalent on all three levels when the microcanon¬ 
ical specific entropy is nonconcave as a function of the 
energy density in the thermodynamic limit 161. 

Here we study ensemble nonequivalence for networks 
with topological constraints 18l-l20l|. Usually, ensemble 
nonequivalence is studied for systems in which the Boltz¬ 
mann distribution describes a certain physical interac¬ 
tion that is encapsulated in the energy. However, as al¬ 
ready shown by Jaynes 0 , the Boltzmann distribution 
describes much more general ensembles of systems with 
given constraints, namely, all solutions to the maximum- 
entropy problem of inference from partial information. 
We argue that, for any discrete enumeration problem 
where we need to count microcanonical configurations 
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compatible with a given constraint, there exists a ‘dual’ 
problem involving canonical configurations induced by 
the same constraint. We prove a general result relating 
measure equivalence to equivalence of the large deviation 
properties of microcanonical and canonical probabilities, 
and provide examples of networks that exhibit nonequiv¬ 
alence whenever the number of constraints is extensive. 
Ensembles. For N G N, let Gn denote the discrete set 
of all configurations with N particles (in the examples 
below, all graphs with N nodes). Let C denote a vector¬ 
valued function on Gn- The microcanonical distribution 
with hard constraint C* is defined as 


Pmic(G) = j 


if C(G) = C* 
else, 


( 1 ) 


where = |{G € Gn- G(G) = C*}| is the num¬ 
ber of configurations that realize C*. Following Jaynes 
21| . we introduce a ‘dual’ problem involving a canon¬ 
ical probability distribution Pcan(G) defined as the so¬ 
lution of the maximization of the Shannon entropy 
S‘jv(Tcan) — "^can(G) lnFl;;an(G) subject to the 

soft constraint (C) = C*, where (•) denotes the av¬ 
erage w.r.t. Pcan, and to the normalization condition 
T,GeGM -Pcan(G) = 1 [H. This givcs 


Pcan(G) = 


exp[-g(G,g-)] 

Z{0*) 


( 2 ) 


where H{G,9) = 9 ■ C{G) is the Hamiltonian and 
■^(^) = SceCAr 6xp[—iL(G, 0)] is the partition function. 
Note that in eq.jS]) the parameter 9 must be set to the 
particular value 9* that realizes (C) = C* [i^. This 
value also maximizes the likelihood |22| . 

Specific relative entropy and large deviations. The rela¬ 
tive entropy of Pmic w.r.t. Pcan is 

Pn,lc(G) 


•Sw (Pmic 11 Pcan) = 


E 

GgGn 


Pmic(G) In 


Pcan(G)- 


(3) 


Following 0, we say that the two ensembles are measure 
equivalent if their specific relative entropy is zero: 


s = lim 
N —>-oo 


(-^mic I l-fcan) 
N 


= 0 . 


(4) 


Before considering specific cases, we make a crucial 
observation. Noting from the form of H{G,9) that 
Pcan(Gi) = Pcan(G2) when G(Gi) = C(G 2 ) (the canon¬ 
ical probability is the same for all configurations with the 
same value of the constraint), we rewrite eq.(l3|) as 


(Pmic 11 Pcan) — In 


Pmic(G*) 

Pcan(G*)’ 


(5) 


where G* is any configuration in Gn such that C(G*) = 
C*. The equivalence condition in (jH) then becomes 

lim ^[lnP„,ic(G*)-lnPcan(G*)] =0, (6) 

N^oo TV- ' 


which demonstrates that nonequivalence coincides with 
Pmic(G*) and Pcan(G*) having different large deviation 
behavior 0. Importantly, this condition is purely local 
as it involves the microcanonical and canonical proba¬ 
bilities of a single microstate G* realizing the hard con¬ 
straint. This greatly simplifies previously studied global 
conditions involving almost all microstates 161. 
Unipartite networks. We now apply the above concepts 
to the class of unipartite graphs, where there is a single 
set of nodes among which all possible links are allowed. 

Let us first consider graphs with a fixed number of links 
L, i.e., C = L. Writing L = XV, where V = N{N — l)/2 
is the number of pairs of nodes and A is the fraction of 
realized links, in the microcanonical ensemble we have 


fir* = 


V 

x*v 


0 < A* < 1. 


(7) 


The canonical ensemble can be obtained from eq.(l2|) by 
setting H{G,9) = 9L{G) and p* = 


i+e- 


= A* 


This produces the Erdos-Renyi random graph where each 
pair of nodes is connected with equal probability p*: 

Pca„(G) = (p’^)^(°)(l (8) 

We can now compute the relative entropy from eq. ([5]) as 
5(Pmic||Pcan) = -A^FlnA* - (1 - A*)^ln(l - A*) 

- In = In v'27rA*(l-A*)F + 0(1/1/), (9) 

where we have used Stirling’s formula n\ = (n/e)"-\/27rn 
[1 -I- 0(l/n)], n —oo. This gives 


lnV2^A*(l-A*)F ^ 
s = hm —-—-= 0, 

N—¥oo N 


( 10 ) 


proving ensemble equivalence. In other words, when 
N ^ oo most graphs have a number of links that is 
close to the average number of links. 

We next consider graphs with a fixed degree sequence, 
i.e. C = k = (ki ,..., feAf) where ki is the number of links 
of node i. This is known as the configuration model [l8| . 
The microcanonical number is not known in general, 
but asymptotic results exist in the ‘sparse case’ where 


^max = max k^ = o{y/N). 

l<2<iV 


In this regime it is known that 


^fc* “ 




{k*'^/2k*f + \+o{N- 


( 11 ) 


( 12 ) 


i=l 

-IV-Af 


where k* = N ^ k* (average degree), L = Nk*/2 
(number of links), and k*'^ = Xl~^J2i=iki'^ (average 


square degree). The canonical ensemble is described [21 
by eq.© where H{G, 9) = 9 ■ k{G) and 9* is such that 


E 

1#* 




1 -I- e 




= kt Vz. 


(13) 
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Setting p* = e / (I + e ), we have 

Pca„(G) = (1 - > (14) 




where and is the entry of the adja¬ 

cency matrix of G. Eg. lfTTI) ensures that fcma v = o{'/L), 
a condition under which eq. da is solved as [20| 


h*h* 

g-Si-e* ^ 


2L* 


= o(l)> 


(15) 


where ^ means that the quotient tends to 1. This implies 
0* ^ — ln(fc*/\/2L*) and ln(l —p* ) ^ —k*k*/2L*. Thus 


N 


In Pcan(G*) ~ ln(2i*) - L^- (16) 

Combining eqs.®, (fT2)l and da, we obtain 


N 


5'(Pmic||f’can) ^ y^lng(/cf) 


\2 


(17) 


+ {k*^/2k*f - \ + o{N-^k* ), 


where q{k) = k\/{k/e)^ > ^/2-Kk for A: > 1. Eg. lfTTI) guar¬ 
antees that the terms in the last line are o{N). Denoting 
a limiting average over nodes with a bar, we arrive at 


s = In q{k*) > In y/2'Kk* > 0, 


(18) 


proving nonequivalence. In other words, when TV —^ oo 
most graphs in the canonical ensemble do not have a de¬ 
gree sequence that is close to the average degree sequence. 
This important result explains various recent findings, 
e.g. the fact that the canonical and microcanonical en¬ 
tropies of random regular graphs are different even in the 
thermodynamic limit [l^ and that canonical fluctuations 
do not vanish in networks with local constraints [l5| . 

As a first example we consider sparse regular networks, 
where every node has the same degree k* = o{'/N). 
Then Infc* = Infc*, so that ea. lTT^ becomes 

s > ln'\/27rfc*, k* = o{y/N). (19) 

Note that when k* grows with N, s diverges like Infc*, 
signalling an extreme violation of equivalence. 

As a second example we consider sparse scale-free net¬ 
works [2^, defined by a truncated power-law degree dis¬ 
tribution of the form Fj^{k) = N~^ l{fci=fc} = 

A^k~'^ withy € (l,oo) for 1 < fc < kc{N) andEjv(A:) = 0 
for k > kc{N), where lim^v-^oo kJN) = 00 and kc{N) = 
o{\/N). This ‘structural cut-off’ ensures eq. dlB , SO 
that eq. dI3 is valid. Approximating F^ik) by a contin¬ 
uous distribution, we see that the normalization of Fn 
implies Ri 7 — 1, and so ea. dT^ leads to 

s > In V^iFk* Ri — -— -I- In (20) 

2(7- 1) 


confirming nonequivalence. As the tail exponent 7 de¬ 
creases, the degree distribution broadens and the degree 
of violation of equivalence increases. 

Taken together, the above examples indicate that en¬ 
semble equivalence holds when there is a single global 
constraint, while it is broken when there is an exten¬ 
sive number of local constraints. They also indicate that 
graphs with local constraints are always nonequivalent, 
irrespective of the breadth of the degree distribution. 
Bipartite networks. We now consider bipartite networks, 
where there are two distinct sets of nodes, and links are 
allowed only between the two sets. A bipartite graph G 
is specified by an x M matrix, where N and M denote 
the numbers of nodes in the two sets. For simplicity, we 
constrain the topological properties on only one set (say, 
the one with N nodes) and regard the other set as an 
‘external environment’. Thus N is the size of the system 
and the criterion in eq.(|T|) still applies. For instance, we 
can think of our bipartite graph as a collaboration net¬ 
work of N articles and M authors. We may want to focus 
only on the properties of the set of articles, while regard¬ 
ing the ‘external’ set of authors fixed. In particular, in 
the limit N ^ 00 we may think of M as a fixed number 
(either finite or infinite). 

If we fix only the total number L of links, then the 
number of microcanonical configurations is still given 
by eq. 0 and the canonical probability, defined via 
F[{G,9) = 9L{G), is the same as in eq.lH]), where now 
V = NM and p* = e"®* is such that (L) = p*V = L*. A 
calculation similar to that leading to ea. ifTO)) shows that 
s = 0, proving again ensemble equivalence. 

We next fix the degree sequence k* = {ki,...,k^) 
of the constrained set. The microcanonical configu¬ 
rations are enumerated exactly as Dg* = 0^=1 i^*)- 
The canonical ensemble is defined by the Hamiltonian 
H{G, 9) = 9-k(G) and is still described by eg. (ITT)) , where 
n j,j = Hill n^i = kf/M [2^. We assume that 

0 < fc* < M for all i to avoid either disconnected nodes 
or fully connected nodes. A direct calculation yields 

s = In ■\y2T^F*fT--T*jM) (21) 

(where the bar again denotes a limiting average, now 
over the N nodes of the constrained set), proving en¬ 
semble nonequivalence. Note that here we have put no 
restriction on fcmax, apart from requiring 0 < fcmax < M. 
Indeed, while eg. (1181) is valid only in the sparse regime, 
ea. (l?T1) holds in the full range of connectivity. 
Irrelevance of (non) additivity. In the physics litera¬ 
ture a connection has been conjectured between ensem¬ 
ble nonequivalence and the nonadditivity of the energy, 
as induced for instance by long-range interactions |14j |. 
By contrast, we now show that in our examples the 
only mechanism leading to nonequivalence is the pres¬ 
ence of an extensive number of local constraints, irrespec¬ 
tive of (non) additivity. To this end, we partition the set 
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of N nodes into two sets Vi and V 2 with = aiN 
and N 2 = a 2 N nodes, respectively, where ai,a 2 > 0 
and ai + a 2 = 1. For a given graph G, we calcu¬ 
late the interaction energy between the two subsystems 
as i/int(G, 9) = H{G, 9) — 9) — i? 2 (G, 0), where 

Hi{G, 9) denotes the restriction of H{G, 9) to the set Vi. 

In our example of unipartite graphs with the single 
constraint C = L, the interaction energy is i/int(G,0) = 
9[L{G) — Li{G) — L 2 (G)], where Li{G) is the number of 
‘internal’ links among the nodes of Vi. Thus Hint{G,9) 
is proportional to the number of links between Vi and 
V 2 , and its expected value is 

(Ffint(0)) = 9^ [7V(7V - 1) - Ni{Ni - 1) - N 2 {N 2 - 1)] 

= 9^N‘^{l-al-al) = 9pN^aia2 ( 22 ) 

where p = e“®/(l -I- e~^) as in eq.®. In the thermody¬ 
namic limit, the ratio of {Hint {9)) to the expected total 
energy {HtotiS)) = 9pN{N - l)/2 is 


{Hintje)) 

{Htotm 


= 2Qfia2 > 0, 


(23) 


which proves nonadditivity due to long-range interactions 


14| . A similar result holds for the configuration model. 


So, unipartite networks are always nonadditive, irrespec¬ 
tive of whether they exhibit nonequivalence. 

By contrast, our examples of bipartite networks are 
always additive, irrespective of whether they exhibit 
(non)equivalence. This occurs because, when partition¬ 
ing the set of N nodes (e.g. articles in our previous exam¬ 
ple), both Vi and V 2 remain connected only to the M ‘ex¬ 
ternal’ nodes (e.g. authors), and not among themselves. 
Indeed, the Hamiltonian only couples the N nodes to the 
external environment and we always get Hint{G,9) = 0 . 
Conclusion. We found that (non)equivalence is deter¬ 
mined by an entirely local criterion involving the large- 
deviation behavior of microcanonical and canonical prob¬ 
abilities of a single microstate rather than of almost all 
microstates (as generally expected 0). This result is 
entirely general and is not restricted to networks. More¬ 
over, we found that the presence of an extensive number 
of local constraints provides a mechanism for ensemble 
nonequivalence in ensembles of graphs. While, in all ex¬ 
amples known so far, nonequivalence was expected to be 
associated with long-range interactions or nonadditivity, 
in our examples (non)equivalence is only determined by 
the number of constraints, irrespective of (non)additivity. 
From a practical point of view, graphs with local con¬ 
straints are routinely used as null models to detect em¬ 
pirical patterns or to reconstruct networks from partial 
information 0. [l8l - E50 . So far, choosing between mi¬ 
crocanonical and canonical implementations [l^ of these 
null models has been perceived as a mere matter of con¬ 
venience. However, our findings imply that one should 
make a careful and principled choice, as results obtained 


using different ensembles may differ substantially. The 
same considerations might extend to other ensembles of 
systems with many constraints, applications of which 
range from biology (e.g. conformational ensembles) to 
finance and neuroscience (e.g. time series ensembles). 
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